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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Arch Development Corporation 

(B) STREET: 1101 East 58th Street 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 60637 

(A) NAME: Michael S. Parmacek 

(B) STREET: 1225 E. 56th Street 

(C) CITY: Chicago 

(D) STATE: IL 

( E ) COUNTRY : USA 

(F) POSTAL CODE (ZIP) : 60637 

(A) NAME: Julian Solway 

(B) STREET: 746 Grove Street 

(C) CITY: Glencoe 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 60022 

(ii) TITLE OF INVENTION: PROMOTER FOR SMOOTH MUSCLE CELL EXPRESSION 
(iii) NUMBER OF SEQUENCES: 51 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) , OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: Unknown 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/726,807 

(B) FILING DATE: 07-OCT-1996 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GAATTCAGGA CGTAATCAGT GGCTGGAAAG CAAGAGCTCT AGAGGAGCTC CAGCTTATTA 



6 
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TGACCCTTCC TTCAGATGCC ACAAGGAGGT GCTGGAGTTC TATGCACCAA TAGCTTAAAC 120 

CAGCCAGGCT GGCTGTAGTG GATTGAGCGT CTGAGGCTGC ACCTCTCTGG CCTGCAGCCA 180 

5 

GTTCTGGGTG AGACTGACCC TGCCTGAGGG TTCTCTCCTT CCCTCTCTCT ACTCCTTTCT 24 0 

CCCTCTCCCT CTCCCTCTCT CTGTTTCCTG AGGTTTCCAG GATTGGGGAT GGGACTCAGA 3 00 

10 GACACCACTA AAGCCTTACC TTTTAAGAAG TTGCATTCAG TGAGTGTGTG AGACATAGCA 36 0 

CAGATAGGGG CAGAGGAGAG CTGGTTCTGT CTCCACTGTG TTTGGTCTTG GGTACTGAAC 42 0 

TCAGACCATC AGGTGTGATA GCAGTTGTCT TTAACCCTAA CCCTGAGCCT GTCTCACCTG 480 

15 

TCCCTTCCCA AGACCACTGA AGCTAGGTGC AAGATAAGTG GGGACCCTTT CTGAGGTGGT 54 0 

AGGATCTTTC ACGATAAGGA CTATTTTGAA GGGAGGGAGG GTGACACTGT CCTAGTCCTC 600 

20 TTACCCTAGT GTCTCCAGCC TTGCCAGGCC TTAAACATCC GCCCATTGTC ACCGCTCTAG 66 0 

AAGGGGCCAG GGTTGACTTG CTGCTAAACA AGGCACTCCC TAGAGAAGCA CCCGCTAGAA 72 0 

GCATACCATA CCTGTGGGCA GGATGACCCA TGTTCTGCCA CGCACTTGGT AGCCTTGGAA 780 

25 

AGGCCACTTT GAACCTCAAT TTTCTC7VACT GTTAAATGGG GTGGTAACTG CTATCTCATA 84 0 

ATAAAGGGGA ACGTGAAAGG AAGGCGTTTG CATAGTGCCT GGTTGTGCAG CCAGGCTGCA 900 

30 GTCAAGACTA GTTCCCACCA ACTCGATTTT AAAGCCTTGC AAGAAGGTGG CTTGTTTGTC 960 

CCTTGCAGGT TCCTTTGTCG GGCCAAACTC TAGAATGCCT CCCCCTTTCT TTCTCATTGA 102 0 

AGAGCAGACC CAAGTCCGGG TAACAAGGAA GGGTTTCAGG GTCCTGCCCA TAAAAGGTTT 108 0 

35 

TTCCCGGCCG CCCTCAGCAC CGCCCCGCCC CGACCCCCGC AGCATCTCCA AAGCATGCAG 114 0 

AGAATGTCTC CGGCTGCCCC CGACAGACTG CTCCAACTTG GTGTCTTTCC CCAAATATGG 12 00 

40 AGCCTGTGTG GAGTGAGTGG GGCGGCCCGG GGTGGTGAGC CAAGCAGACT TCCATGGGCA 126 0 

GGGAGGGGCG CCAGCGGACG GCAGAGGGGT GACATCACTG CCTAGGCGGC CTTTAAACCC 13 2 0 

CTCACCCAGC CGGCGCCCCA GCCCGTCTGC CCCAGCCCAG ACACCGAAGC TACTCTCCTT 13 8 0 

CCAGTCCACA AACGACCAAG CCTTGTAAGT GCAAGTCAT 1419 
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(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 
10 (A) NAME/KEY: CDS 

(B) LOCATION: 38. .218 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
15 (B) LOCATION: 322. ,500 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 866 . . 967 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



CTTTTCTCCA CACTCTATAC TTTAGCTCTG CCTCAAC ATG GCC AAC AAG GGT CCA 5 5 

Met Ala Asn Lys Gly Pro 

25 .15 

TCC TAC GGC ATG AGC CGA GAA GTG CAG TCC AAA ATT GAG AAG AAG TAT 10 3 

Ser Tyr Gly Met Ser Arg Glu Val Gin Ser Lys lie Glu Lys Lys Tyr 
10 15 20 

30 

GAC GAG GAG CTG GAG GAG CGA CTA GTG GAG TGG ATT GTA GTG CAG TGT 151 
Asp Glu Glu Leu Glu Glu Arg Leu Val Glu Trp lie Val Val Gin Cys 
25 30 35 

35 GGC CCT GAT GTA GGC CGC CCA GAT CGT GGG CGC CTG GGC TTC CAG GTG 19 9 

Gly Pro Asp Val Gly Arg Pro Asp Arg Gly Arg Leu Gly Phe Gin Val 
40 45 50 

TGG CTG AAG AAT GGT GTG G TGAGTAACCC TTGCGAAGGG AATCTAGGGA 24 8 

40 Trp Leu Lys Asn Gly Val 
55 60 

TGTGTATGCC GCCCTACAAA CTGTGAGACA GACTCCCTGA GCTGAGTGTT CAGTTGTGTT 3 08 

45 CTGTACCTGG CAG ATT CTG AGC AAA TTG GTG AAC AGC CTG TAT CCT GAG 3 57 

lie Leu Ser Lys Leu Val Asn Ser Leu Tyr Pro Glu 
15 10 

GGA TCG AAG CCA GTG AAG GTG CCT GAG AAC CCA CCC TCC ATG GTC TTT 40 5 

50 Gly Ser Lys Pro Val Lys Val Pro Glu Asn Pro Pro Ser Met Val Phe 
15 20 25 
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AAG CAG ATG GAA CAG GTG GCT CAA TTC TTG AAG GCA GCT GAA GAT TAT 4 53 
Lys Gin Met Glu Gin Val Ala Gin Phe Leu Lys Ala Ala Glu Asp Tyr 
30 35 40 

GGA GTC ATC AAG ACT GAC ATG TTC CAG ACT GTT GAC CTC TAT GAA GG 50 0 
Gly Val lie Lys Thr Asp Met Phe Gin Thr Val Asp Leu Tyr Glu 
45 50 55 

TATAAGGAAA AAAGGGCTGG AGCCAGTGGG CGAGTGGAGA GCAAGATTAT CAGTCAAGGA 56 0 

GAAGGAATAT CAAAAGCCAC AACCAGCTCT GTTGATGTGT TCATAGCAGG AATGGGATAT 62 0 

GCCAAGAGAA CACATAGCAA GGGGACCAGC TTGGTGGTAC AGCATTTCCT TCTGGGTACA 6 80 

15 AGGGCCTGTT TTGGATCCTA GAATATCAAA TATATACCAC ACCATACTCA CTAGGGTTTA 74 0 

GAATATGGTC TCTTGAACCC TCTTGATTTG GTGCCACTTG CTCCTTGGTT GGACCATTTT 800 

TGAAGCTGGG CAGGTATTGC CTATATGGTC CTGAAATTAG CTCCCTGGCC ACTCTTCTCA 86 0 

20 

TAGGT AAG GAT ATG GCA GCA GTG CAG AGG ACT CTA ATG GCT TTG GGC 907 
Lys Asp Met Ala Ala Val Gin Arg Thr Leu Met Ala Leu Gly 
15 10 

25 AGT TTG GCT GTG ACC AAA AAC GAT GGA AAC TAC CGT GGA GAT CCC AAC 955 
Ser Leu Ala Val Thr Lys Asn Asp Gly Asn Tyr Arg Gly Asp Pro Asn 
15 20 25 30 

TGG TTT ATG AAG TATGTGTCCA CTGGGTCTCT CTGT 9 91 
30 Trp Phe Met Lys 
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(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ JD NO: 3: 



Met Ala Asn Lys Gly Pro Ser Tyr Gly Met Ser Arg Glu Val Gin Ser 
45 1 5 10 15 

Lys lie Glu Lys Lys Tyr Asp Glu Glu Leu Glu Glu Arg Leu Val Glu 
20 25 30 

50 Trp lie Val Val Gin Cys Gly Pro Asp Val Gly Arg Pro Asp Arg Gly 
35 40 45 



Arg Leu Gly Phe Gin Val Trp Leu Lys Asn Gly Val 
50 55 60 
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(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

lie Leu Ser Lys Leu Val Asn Ser Leu Tyr Pro Glu Gly Ser Lys Pro 
15 10 15 

Val Lys Val Pro Glu Asn Pro Pro Ser Met Val Phe Lys Gin Met Glu 
20 25 30 

Gin Val Ala Gin Phe Leu Lys Ala Ala Glu Asp Tyr Gly Val lie Lys 
35 40 45 

Thr Asp Met Phe Gin Thr Val Asp Leu Tyr Glu 
50 55 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Lys Asp Met Ala Ala Val Gin Arg Thr Leu Met Ala Leu Gly Ser Leu 
1 5 10 15 

Ala Val Thr Lys Asn Asp Gly Asn Tyr Arg Gly Asp Pro Asn Trp Phe 
20 25 30 

Met Lys 



(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 28. .169 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ACTTACCCTG GTTCCTTTTC TTCTAGG AAA GCC CAG GAG CAT AAG AGG GAC 51 

Lys Ala Gin Glu His Lys Arg Asp 
1 5 

TTC ACA GAC AGC CAA CTG CAG GAG GGG AAG CAC GTC ATT GGC CTT CAA 9 9 

Phe Thr Asp Ser Gin Leu Gin Glu Gly Lys His Val lie Gly Leu Gin 
10 15 20 

15 ATG GGC AGC AAC AGA GGA GCC TCG CAG GCT GGC ATG ACA GGC TAT GGG 14 7 

Met Gly Ser Asn Arg Gly Ala Ser Gin Ala Gly Met Thr Gly Tyr Gly 
25 30 35 40 

CGA CCC CGG CAG ATC ATC AGT T AGAAAGGGAA GGCCAGCCCT GAGCTGCAGC 199 
20 Arg Pro Arg Gin lie lie Ser 

45 

ATCCTGCTTA GCCTGCCTCA CAAATGCCTA TGTAGGTTCT TAGCCCTGAC AGCTCTGAGG 259 

25 TGTCACTGGG CAAAGATGAC TGCACATGGG CAGCTCCCAC CTATCCTTAG CCTCAGCCCA 319 

GCATCTTACC CCAGAGCCAC CACTGCCCTG GCCCCTGTTC CCAGCTGTAC CCCCACCTCT 379 

- ACTGTTCCTC TCATCCTGGA GTAAGCAGGG AGAAGTGGGC TGGGGTAGCT GGCTGTAGGC 43 9 

30 

CAGCCCACTG TCCTTGATAT CGAATGTCCT TTGAAGGAGA CCCAGCCCAG CCTCTACATC 4 99 

TTTTCCTGGA ATATGTTTTT GGGTTGAAAT TCAAAAAGGA AAAAAGAAAA ATATATAAAT 5 59 

35 ATATATATAT ATATAC 575 



(2) INFORMATION FOR SEQ ID NO : 7: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

45 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Lys Ala Gin Glu His Lys Arg Asp Phe Thr Asp Ser Gin Leu Gin Glu 
1 5 10 15 

Gly Lys His Val lie Gly Leu Gin Met Gly Ser Asn Arg Gly Ala Ser 
20 25 30 
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Gin Ala Gly Met Thr Gly Tyr Gly Arg Pro Arg Gin lie lie Ser 
35 40 45 



5 (2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY : CDS 
15 (B) LOCATION: 77. .681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GCCCGTCTGC CCCAGCCCAG ACACCGAAGC TACTCTCCTT CCAGTCCACA AACGACCAAG 6 0 

20 

CCTTCTCTGC CTCAAC ATG GCC AAC AAG GGT CCA TCC TAC GGC ATG AGC 109 

Met Ala Asn Lys Gly Pro Ser Tyr Gly Met Ser 
15 10 

25 CGA GAA GTG CAG TCC AAA ATT GAG AAG AAG TAT GAC GAG GAG CTG GAG 157 
Arg Glu Val Gin Ser Lys lie Glu Lys Lys Tyr Asp Glu Glu Leu Glu 
15 20 25 

GAG CGA CTA GTG GAG TGG ATT GTA GTG CAG TGT GGC CCT GAT GTA GGC 2 05 

30 Glu Arg Leu Val Glu Trp lie Val Val Gin Cys Gly Pro Asp Val Gly 
30 35 40 

CGC CCA GAT CGT GGG CGC CTG GGC TTC CAG GTG TGG CTG AAG AAT GGT 2 53 

Arg Pro Asp Arg Gly Arg Leu Gly Phe Gin Val Trp Leu Lys Asn Gly 
35 45 50 55 

GTG ATT CTG AGC AAA TTG GTG AAC AGC CTG TAT CCT GAG GGA TCG AAG 3 01 

Val lie Leu Ser Lys Leu Val Asn Ser Leu Tyr Pro Glu Gly Ser Lys 
60 65 70 75 

40 

CCA GTG AAG GTG CCT GAG AAC CCA CCC TCC ATG GTC TTT AAG CAG ATG 34 9 

Pro Val Lys Val Pro Glu Asn Pro Pro Ser Met Val Phe Lys Gin Met 
80 85 90 

45 GAA CAG GTG GCT CAA TTC TTG AAG GCA GCT GAA GAT TAT GGA GTC ATC 3 97 

Glu Gin Val Ala Gin Phe Leu Lys Ala Ala Glu Asp Tyr Gly Val lie 
95 100 105 

AAG ACT GAC ATG TTC CAG ACT GTT GAC CTC TAT GAA GGT AAG GAT ATG 44 5 

50 Lys Thr Asp Met Phe Gin Thr Val Asp Leu Tyr Glu Gly Lys Asp Met 
110 115 120 
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GCA GCA GTG CAG AGG ACT CTA ATG GCT TTG GGC AGT TTG GCT GTG ACC 4 93 

Ala Ala Val Gin Arg Thr Leu Met Ala Leu Gly Ser Leu Ala Val Thr 
125 130 135 

5 AAA AAC GAT GGA AAC TAG CGT GGA GAT CCC AAC TGG TTT ATG AAG AAA 541 

Lys Asn Asp Gly Asn Tyr Arg Gly Asp Pro Asn Trp Phe Met Lys Lys 
140 145 150 155 

GCC CAG GAG CAT AAG AGG GAC TTC ACA GAC AGC CAA CTG CAG GAG GGG 58 9 

10 Ala Gin Glu His Lys Arg Asp Phe Thr Asp Ser Gin Leu Gin Glu Gly 

160 165 170 



AAG CAC GTC ATT GGC CTT CAA ATG GGC AGC AAC AGA GGA GCC TCG CAG 6 37 

Lys His Val lie Gly Leu Gin Met Gly Ser Asn Arg Gly Ala Ser Gin 
15 175 180 185 

GCT GGC ATG ACA GGC TAT GGG CGA CCC CGG CAG ATC ATC AGT TA 681 
Ala Gly Met Thr Gly Tyr Gly Arg Pro Arg Gin lie lie Ser 
190 195 200 

20 

GAAAGGGAAG GCCAGCCCTG AGCTGCAGCA TCCTGCTTAG CCTGCCTCAC AAATGCCTAT 741 
GTAGGTTCTT AGCCCTGACA GCTCTGAGGT GTCACTGGGC AAAGATGACT GCACATGGGC 8 01 

25 AGCTCCCACC TATCCTTAGC CTCAGCCCAG CATCTTACCC CAGAGCCACC ACTGCCCTGG 861 
CCCCTGTTCC CAGCTGTACC CCCACCTCTA CTGTTCCTCT CATCCTGGAG TAAGCAGGGA 921 
GAAGTGGGCT GGGGTAGCTG GCTGTAGGCC AGCCCACTGT CCTTGATATC GAATGTCCTT 981 

30 

TGAAGGAGAC CCAGCCCAGC CTCTACATCT TTTCCTGGAA TATGTTTTTG GGTTGAAATT 1041 



CAAAAAGGAA AAAAGAAAAA TATATAAATA TAT AT AT ATA CAAAAAAAAA AAAAAAAAAA 1101 
A 1102 



(2) INFORMATION FOR SEQ ID NO : 9: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Met Ala Asn Lys Gly Pro Ser Tyr Gly Met Ser Arg Glu Val Gin Ser 
15 10 15 

50 

Lys lie Glu Lys Lys Tyr Asp Glu Glu Leu Glu Glu Arg Leu Val Glu 
20 25 30 
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Trp lie Val Val Gin Cys Gly Pro Asp Val Gly Arg Pro Asp Arg Gly 
35 40 45 

Arg Leu Gly Phe Gin Val Trp Leu Lys Asn Gly Val lie Leu Ser Lys 
50 55 60 

Leu Val Asn Ser Leu Tyr Pro Glu Gly Ser Lys Pro Val Lys Val Pro 
65 70 75 80 

Glu Asn Pro Pro Ser Met Val Phe Lys Gin Met Glu Gin Val Ala Gin 
85 90 95 

Phe Leu Lys Ala Ala Glu Asp Tyr Gly VaL lie Lys Thr Asp Met Phe 
100 105 110 

Gin Thr Val Asp Leu Tyr Glu Gly Lys Asp Met Ala Ala Val Gin Arg 
115 120 125 

Thr Leu Met Ala Leu Gly Ser Leu Ala Val Thr Lys Asn Asp Gly Asn 
130 135 140 

Tyr Arg Gly Asp Pro Asn Trp Phe Met Lys Lys Ala Gin Glu His Lys 
145 150 155 160 

Arg Asp Phe Thr Asp Ser Gin Leu Gin Glu Gly Lys His Val lie Gly 
165 170 175 

Leu Gin Met Gly Ser Asn Arg Gly Ala Ser Gin Ala Gly Met Thr Gly 
180 185 190 

Tyr Gly Arg Pro Arg Gin lie lie Ser 
195 200 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ATCGAATTCC GCTACTCTCC TTCCAGCCCA CAAACGACCA AGC 
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(2) INFORMATION FOR SEQ ID NO : 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRAXTDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATCAAGCTTG GTGGGAGCTG CCCATGTGCA GTC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TGCCGTAGGA TGGACCCTTG TTGGC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATXjRE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION:! 

(D) OTHER INFORMATION: /mod_base= OTHER 
/note= "Y = C or T/U" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /mod_base= OTHER 
/note= "W = A or T/U" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /mod_base= OTHER 
/note= "R = A or G" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
YTAWAAATAR 



(2) INFORMATION FOR SEQ ID NO; 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TTTAAAATCG 



(2) INFORMATION FOR SEQ ID NO : 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TTCAAAATAG 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

Cys Cys Cys Met Asn Ser Ser Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Lys Arg Gly Gly Cys Lys Arg Arg Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Thr Lys Asn Asn Gly Asn Ala Ala Lys 
1 5 



20 



30 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
25 (B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met lie Arg lie Cys Arg Lys Lys 
1 5 



35 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 81 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

45 AGTCAAGACT AGTTCCCACC AACTCGATTT TAAAGCCTTG CAAGAAGGTG GCTTGTTTGT 6 0 

CCCTTGCAGG TTCCTTTGTC GGGCCAAACT CTAGAATGCC TCCCCCTTTC TTTCTCATTG 12 0 

AAGAGCAGAC CCAAGTCCGG GTAACAAGGA AGGGTTTCAG GGTCCTGCCC ATAAAAGGTT 180 

50 

TTTCCCGGCC GCCCTCAGCA CCGCCCCGCC CCGACCCCCG CAGCATCTCC AAAGCATGCA 24 0 

GAGAATGTCT CCGGCTGCCC CCGACAGACT GCTCCAACTT GGTGTCTTTC CCCAAATATG 3 00 
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GAGCCTGTGT GGAGTGAGTG GGGCGGCCCG GGGTGGTGAG CCAAGCAGAC TTCCATGGGC 
AGGGAGGGGC GCCAGCGGAC G 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AAGGAAGGGT TTCAGGGTCC TGCCCATAAA AGGTTTTTCC CGGCCGC 



(2) INFORI^IATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
AAGGAAGGGT TTCAGGGTCC TGCCCATAGA TCTTTTTTCC CGGCCGC 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CCGCCCTCAG CACCGCCCCG CCCCGAGGCC CGCAGCATGT CCG 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCGCCCTCAG CACCGCGGAT CCCCGACCCC CGCAGCATCT CCG 



(2) INFORMATION FOR SEQ ID NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 25: 
CTCCAAAGCA TGCAGAGAAT GTCTCCGGCT GCCCCCG 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CTCGGATCCA TGCTAGCAAT GAATTCGGCT GCCCCCG 



(2) INFORMATION FOR SEQ ID NO : 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TCCAACTTGG TGTCTTTCCC CAAATATGGA GCCTGTGTGG AGTG 



(2) INFORMATION FOR SEQ ID NO : 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TCCAACTTGG TGTCTTTCCC CAAGGATCCA GCCTGTGTGG AGTG 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TCCAACTTGG TGTCTTTCCC CGGATATGGA GCCTGTGTGG AGTG 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 30: 
TCCAACTTGG TGTCTTTCCC CAAATTAGGA GCCTGTGTGG AGTG 



(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GGGCAGGGAG GGGCGCCAGC G 



(2) INFORMATION FOR SEQ ID NO : 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGGCAGGTAC CGAATTCAGC G 



(2) INFORMATION FOR SEQ ID NO : 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGACGGCAGA GGGGTGACAT CACTGCCTAG GCGGCCG 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGACGGCAGA GGGGATCCAT GCCTGCCTAG GCGGCCG 



(2) INFORMATION FOR SEQ ID NO : 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGACGGCAGA GGGGATCCAT CACTGCCTAG GCGGCCG 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CTGGCTAAAG GGGCGGGGCT TGGCCAGCC 



(2) INFORMATION FOR SEQ ID NO : 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CTCCCATTTC CATGACGTCA TGGTTA 



(2) INFORMATION FOR SEQ ID NO : 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
AAGGAAGGGT TTCAGGGTCC TGCCCATAGA TCTTTTTTCC CGGCCGC 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 39: 
CCGCCCTCAG CACCGCGGAT CCCCGACCCC CGCAGCATCT CCG 



(2) INFORMATION FOR SEQ ID NO : 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CTCGGATCCA TGCTAGCAAT GAATTCGGCT GCCCCCG 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 41: 
TCCAACTTGG TGTCTTTCCC CAAGGATCCA GCCTGTGTGG AGTG 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TCCAACTTGG TGTCTTTCCC CGGATATGGA GCCTGTGTGG AGTG 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ IDi NO: 43: 
TCCAACTTGG TGTCTTTCCC CAAATTAGGA GCCTGTGTGG AGTG 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGGCAGGTAC CGAATTCAGC G 



(2) INFORMATION FOR SEQ ID NO : 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGACGGCAGA GGGGATCCAT GCCTGCCTAG GCGGCCG 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGACGGCAGA GGGGATCCAT CACTGCCTAG GCGGCCG 



(2) INFORMATION FOR SEQ ID NO: 47: 

{ i ) S EQUENCE CHARACTER I S T I CS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 3 . .8 
(D) OTHER INFORMATION : /mod_bas 

/note= "W = A or T" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 



CCWWWWWWCC 
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(2) INFORMATION FOR SEQ ID NO : 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
CTCCAACTTG GTGTCTTTCC CCGGATATGG AGCCTGTGTG GAGTG 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 49: 
CTCCAACTTG GTGTCTTTCC CCAAATTAGG AGCCTGTGTG GAGTG 



(2) INFORMATION FOR SEQ ID NO : 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CCAAATATGG 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
CCATATATGG 



